K-Means Based Approach for OLAP Dimension Updates
نویسنده
چکیده
Actual data warehouses models usually consider OLAP dimensions as static entities. However, in practice, structural changes of dimensions schema are often necessary to adapt the multidimensional database to changing requirements. This paper presents a new structural update operator for OLAP dimensions. This operator can create a new level to which, a pre-existent level in an OLAP dimension hierarchy rolls up. To define the domain of the new level and the aggregation function from an existing level to the new level, our operator classifies all instances of an existing level into k clusters with the k-means clustering algorithm. To choose features for k-means clustering, we propose two solutions. The first solution uses descriptors of the pre-existent level in its dimension table while the second one proposes to describe the new level by measures attributes in the fact table. Moreover, we carried out some experimentations within Oracle 10 g DBMS which validated the relevance of our approach.
منابع مشابه
Supporting Dimension Updates in an OLAP Server
Commercial OLAP systems usually treat OLAP dimensions as static entities. In practice, dimension updates are often needed to adapt the warehouse to changing requirements. In earlier work, we defined a taxonomy for these dimension updates and a minimal set of operators to perform them. In this paper we present TSOLAP, an OLAP server supporting fully dynamic dimensions. TSOLAP conforms to the OLE...
متن کاملRoK: Roll-Up with the K-Means Clustering Method for Recommending OLAP Queries
Dimension hierarchies represent a substantial part of the data warehouse model. Indeed they allow decision makers to examine data at different levels of detail with On-Line Analytical Processing (OLAP) operators such as drill-down and roll-up. The granularity levels which compose a dimension hierarchy are usually fixed during the design step of the data warehouse, according to the identified an...
متن کاملA Temporal Query Language for OLAP: Implementation and a Case Study
Commercial OLAP systems usually treat OLAP dimensions as static entities. In practice, dimension updates are often necessary in order to adapt the multidimensional database to changing requirements. In earlier work we proposed a temporal multidimensional model and TOLAP, a query language supporting it, accounting for dimension updates and schema evolution at a high level of abstraction. In this...
متن کاملDimension Updates and Hierarchy Maintenance in OLAP Database
OLAP systems are designed by implementing the multidimensional data model. Data facts in the model are viewed as points in a space of application-related dimensions. Usually they are organized in levels which conform to a hierarchy. OLAP database facilitates a special class of queries performed on large collection of integrated data that support the decision making process. Central issue in dat...
متن کاملMaintaining Data Cubes under Dimension Updates
OLAP systems support data analysis through a mul-tidimensional data model, according to which data facts are viewed as points in a space of application-related \dimensions", organized into levels which conform a hierarchy. The usual assumption is that the data points reeect the dynamic aspect of the data warehouse, while dimensions are relatively static. However, in practice, dimension updates ...
متن کامل